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Abstract 

Spatial analytical techniques and models are often used in epidemiology to identify spatial anomalies 
(hotspots) in disease regions. These analytical approaches can be used to identify not only the location of 
such hotspots, but also their spatial patterns. We used spatial autocorrelation methodologies, including 
Global Moran's I and Local Getis-Ord statistics, to describe and map spatial clusters and areas in which 
nine malignant neoplasms are situated in Taiwan. In addition, we used a logistic regression model to test 
the characteristics of similarity and dissimilarity between males and females and to formulate the common 
spatial risk. The mean found by local spatial autocorrelation analysis was used to identify spatial cluster 
patterns. We found a significant relationship between the leading malignant neoplasms and well- 
documented spatial risk factors. For instance, in Taiwan, the geographic distribution of clusters where oral 
cavity cancer in males is prevalent was closely correspond to the locations in central Taiwan with serious 
metal pollution. In females, clusters of oral cavity cancer were closely related with aboriginal townships in 
eastern Taiwan, where cigarette smoking, alcohol drinking, and betel nut chewing are commonplace. The 
difference between males and females in the spatial distributions was stark. Furthermore, areas with a 
high morbidity of gastric cancer were clustered in aboriginal townships where the occurrence of 
Helicobacter pylori is frequent. Our results revealed a similarity between both males and females in spatial 
pattern. Cluster mapping clarified the spatial aspects of both internal and external correlations for the nine 
malignant neoplasms. In addition, using a method of logistic regression also enabled us to find 
differentiation between gender-specific spatial patterns. 

Key words Spatial autocorrelation calculations, logistic regression, age-adjusted rate, malignant neoplasm, 
Taiwan 



The study of spatially related objects or features 
involves first describing the regional characteristics that 
differentiate areas from one another before proceeding 
with an analysis of spatial interrelations Common 
spatial techniques used in health research include 
disease mapping, clustering techniques, diffusion 
studies, identification of risk factors through map 
comparisons, and regression analysis'^. Spatial clustering 
techniques are essential for statistical consideration and 
form the starting approach in the development of models 
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for predicting disease risk sites. Disease risk sites are 
areas located near one another, with a tendency to share 
similar disease risk factors because they apportion 
similar environments and are also often connected by the 
spread of communicable diseases via vectors or host 
dispersal'^'. 

Cuzick et al. ™ proposed three general criteria that 
can be utilized for cluster detection: (1) cell counts, (2) 
autocorrelative adjacencies of cells with high counts, and 
(3) distance between events. Numerical methods have 
been adopted extensively for spatial cluster detection in 
health research and epidemiology, especially for 
processing areal data. The analytical approaches include 
the following: join-count statistics'^', Ohno statistics 
Poisson statistics!^. Global Geary's C"^™', Global Moran's 
\^'^"\ General Getis-Ord 's G ™, Local Moran's |["i, and 
Local Gi{cl} and G/'*(cf)'""^''. Spatial autocorrelation statistics, 
such as Moran's I and Geary's C methods, are global 
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in the sense that they estimate the overall degree of 
spatial autocorrelation in a dataset. The possibility of 
spatial heterogeneity suggests that the estimated degree 
of autocorrelation may vary across geographical space 
significantly. Local spatial autocorrelation statistics 
provide estimates disaggregated to the level of the 
spatial analysis units, allowing assessment of 
dependency relationships in different areas. Local Gi(d) 
and G/*(y) statistics can be used to conduct autocorrelation 
comparisons in different neighborhoods. A global 
average is used to help identify local regions of strong 
autocorrelation. A local version of the Moran's I and 
Geary's C statistics is also available. 

This study applies spatial autocorrelation methods to 
medical care data collected by Taiwan National Health 
Insurance to determine the spatial clustering associated 
with nine malignant neoplasms and compare the 
similarity of local clusters between males and females. 
Furthermore, potential spatial risks that contribute to 
these healthcare events and redefine epidemiological 
and spatially referenced data are also investigated. 



Objects and Methods 

Study areas 

The study areas included the mainland of Taiwan 
only (excluding all islets), which in 2000 comprised more 



than 22 million inhabitants living in an area of 36 000 km^. 
A total of 349 local administrative government areas 
included five main urban areas, two secondary urban 
areas, 162 rural townships, and 54 plain and aboriginal 
mountain townships (Figure 1). According to a bulletin 
issued in 2002 from the Ministry of the Interior, urban 
areas were regions that possess at least one 
metropolitan center and included neighboring cities and 
townships that share socioeconomic activities. Main 
urban areas were defined as those with a population 
larger than one million, specifically, Taipei-Keelung, 
Kaohsiung, Taichung-Changhua, Jhongli-Taoyuan, and 
Tainan. Secondary urban areas were defined as those 
with a residential population ranging from 0.3 to 1 million 
(such as Hsinchu and Chiayi). 

Data collection and management 

The Taiwan National Health Insurance (NHI) 
program was implemented in 1995. The coverage rate of 
the program increased from 92.41% of the Taiwanese 
residential population in 1995 to more than 96.16% in 
2000. Coverage further increased to 98% after the 
inclusion of active military in 2001 . The average 
population in 2005-2009 numbered more than 22.9 
million'^^i. 

Once the NHI medical care data were properly 
collected and analyzed, a complete picture of population 
behaviors according to disease could be drawn for 
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Figure 1. Map of urban 
areas and aboriginal town- 
ships In the study areas. Map 

of the study areas was 
divided into 349 administrative 
districts, including seven urban 
areas and an integrated area 
of 54 plain and aboriginal 
mountain townships. 
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reference in the calculation of prevalence and incidence 
of various diseases. At the beginning of 2004, NHI data 
related to medical care, such as the leading causes of 
death, were reclassified and reprocessed in relation to 
smaller units or areas (e.g., precincts or townships 
rather than the country as a whole). NHI publicly 
announced regional data from the statistical analysis 
system (SAS) program in regular annual reports (e.g., 
NHI, 2005-2009 These reports are an accurate and 
reliable data source to help investigators examine 
healthcare issues in Taiwan. 

The data were collected from contractual medical 
care institutions, which in this study are institutions 
where NHI covers prescription medicinal costs and 
treatment at outpatient clinics. Such facilities accumulate 
detailed databases of medical costs for inpatient care. 
The number of outpatient cases was classified in relation 
to disease codes, as defined in the 1975 edition of The 
International Classification of Diseases, Ninth Revision, 
Clinical Modification (hereafter, ICD 9 CM). Criteria for 
refining the data were first established. However, for 
calculating medical costs and number of visits, the 
disease code of the first group (main diagnosis code) 
was used as the cause of disease. Certain data were not 
included in the final statistical data set, for instance, in 
cases where patients suffered from diseases that defied 
code classification or in cases with mismatched ID 
numbers. Disease codes were classified according to 
gender and age. Cases sharing the same ID numbers 
though referencing different diseases were counted as 
separate instances'^"^^'. 

Medical care data, such as the number of the cause 
of disease, obtained from the NHI reports between 
2005-2009 were examined, and the prevalence rates of 
the nine malignant neoplasms were calculated. Disease 
classifications were compiled according to the ICD 9 CM 
and are indicated in parentheses. They include the 
following: tracheal, bronchial, and lung cancer (ICD 162); 
liver and intrahepatic bile duct cancer (ICD 155); colon 
and rectal cancer (ICD 153, 154); stomach cancer (ICD 
151); oral cavity cancer (ICD 140, 141, 143-146, 148, 
149); esophageal cancer (ICD 150); pancreatic cancer 
(ICD 157); non-Hodgkin's lymphoma (ICD 200, 202, 
203); and leukemia (ICD 204-208). The Ministry of the 
Interior provided the mid-year population of each 
township for demographic background The smallest 
administrative units coded for this study were precincts 
and townships. The age-adjusted standard prevalence 
rates were calculated with a direct adjustment using the 
world population in 2000 as the standard population '^''i. 
The age-adjusted standard prevalence rates during 
2005-2009 were calculated according to which five-year 
prevalence rates were weighted by persons each year. 
The results showed the leading malignant neoplasms for 
males and females in both the whole of Taiwan and 
each township in the study area and were consequently 



applied to the calculation of spatial autocorrelation. 
Determining spatial weiglits/connectivity matrices 

The spatial weights matrix is a simple approach for 
representing similarities between location and attribute 
values in a region of contiguous areal objects. 
Autocorrelation was identified by the presence of 
neighboring cells or zones that take the same (binary) 
attribute value. Comparing neighboring values of spatial 
attributes by defining a weights matrix W in which each 
element Wij measures the locational similarity of ; and j (/ 
identifies the row and j the column of the matrix) was a 
common practice. A simple measure of contiguity was 
used, coding W/j = ^ if regions / and j were contiguous 
and Wij= 0 othera/ise. was set equal to 0 for all /'^'. 

The spatial weights matrix effectuated developing 
measures of spatial autocorrelation using four of the 
attribute types (nominal, ordinal, interval, and ratio). Any 
measure of spatial autocorrelation sought to compare a 
set of locational similarities (contained in a weights 
matrix) with a corresponding set of attribute similarities 
Cj, combining them into a single index in the form of a 
cross-product: T^jCaWii. 

This expression was the total obtained by multiplying 
every cell in the W matrix with its corresponding entry in 
the C matrix before summing. 

Several approaches were used to measure c„ as the 
nominal, ordinal, interval, and ratio scale. For nominal 
data, the typical approach involved setting c,j to 1 if / and 
) took the same attribute value and 0 otherwise. For 
ordinal data, similarity was based on comparing the 
ranks of / and / For interval and ratio data, the attribute 
of interest was denoted x, and the product (Xj-u)(x;-t/) 
was calculated, where u denoted the average of u}^\ 

In this study, spatial contiguity for polygons was the 
property of sharing a common boundary or vertex. 
Contiguity analysis was a significant method for assessing 
unusual features in the connectivity distribution P'^'. The 
queen's measure of contiguity was utilized to make up 
for spatial contiguity by incorporating both the rook and 
bishop relationships into a single measure The 
administrative districts considered in this study were 
highly irregular in both shape and size. Tsai ef al. 
showed that the most appropriate approach was the first 
order queen polygon contiguity method for quantifying 
the spatial weights matrix for connectivity analysis. The 
spatial weights/connectivity matrices were utilized in the 
Global Moran's I and Local G'(d) calculations described 
hereafter. The spatial weights matrix files were 
automatically produced by ArcMap 9.3. 

Global Moran ' s I statistic 

The global spatial autocorrelation statistical method 
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was used to measure the correlation between 
neighboring observations to find the patterns and the 
levels of spatial clustering among neighboring districts '^''i. 
The Moran's I statistic, similar to the Pearson correlation 
coefficient'^^!, ^vas calculated by the following formulae: 

^ (xi-u)(xj-u) 

\2 



/ = — V V w,. 



(1) 



where N is the number of districts; Wi, is the element in 
the spatial weight matrix corresponding to the 
observation pair /, j; and Xj are observations for areas / 
and ywith mean u; and 



(2) 



Because the weights were row-standardized = 1, to 
sum the row totals in the weights matrix to 1 , the first 
step in the spatial autocorrelation analysis required 
constructing a spatial weight matrix that contained 
information regarding the neighborhood structure for 
each location. Adjacency was defined as directly 
neighboring administrative districts inclusive of the district 
itself. Non-neighboring administrative districts were given 
a weight of 0. 

Local Gi*(d) statistic 

The local Gt (d) statistic (local G-statistic) was used 
to test the statistical significance of local clusters as 
related to the nine malignant neoplasms and to 
determine the spatial extent of these clusters The 
local G-statistic, which was useful for identifying 
individual members of local clusters by determining the 
spatial dependence and relative magnitude between an 
observation and neighboring observations ™, could be 
written as followsi^^'^'''^^i: 



G(d) = " ' =^ , for all j 
(n-l) 



(3) 



where x is a measure of the prevalence rate of each 
leading malignant neoplasm within a given polygon, such 
as each administrative district; is a spatial weight that 
defines neighboring administrative districts y to /; and Wi 

is the sum of the weights w,;, x = -i- ^ x . , = ^ . ■ 

x^-x" . Developing the spatial weights 1% was 
the first step for calculating Gi^(d). The spatial weight 
matrix included ia^ = 1 . The adjacency was defined by 
using a first order queen polygon continuity weight file 
that was constructed based on districts sharing common 
boundaries and vertices. 

Non-neighboring administrative districts were given a 
weight of 0. The neighbors of an administrative district 
were defined as those that share a boundary with the 



administrative district. A simple 0/1 matrix was formed, 
where 1 indicates that the municipalities share common 
border or vertex and 0 indicates the contrary i^''^'''. 

The local G-statistic included the value in the 
calculation at /. Assuming that Gi'(d) is approximately 
normally distributed t^^', the output of Gt(d) can be 
calculated as a standard normal variant with an 
associated probability from the z-score distribution ™. 
Clusters with a 95% significance level from a two-tailed 
normal distribution indicate significant spatial clustering, 
though only positively significant clusters (with a z-score 
value greater than +1 .96) are mapped. 

Logistic regression analysis 

Similarities between spatial distribution patterns for 
males and females were displayed. In addition to 
mapping, logistic regression was performed. The binary 
response indicates whether a significant autocorrelation 
exists between administrative districts or areas. A higher 
correlation is present if the absolute value of the z-score 
of the local G-statistics is larger than 1.96, and a lower 
correlation otherwise. Gender was considered an 
explanatory variable in the logistic regression model. 
Thus, the model was expressed as: 

, / PrfHieher correlation)\ „ „ 

iTTT^ TT-rr^" ^ ^'"'^^ (4) 

\ rr(Lower correlation) / v^' 

where /3o and /3o are the logistic regression coefficients of 
the model. Pr (Higher correlation) and Pr (Lower 
correlation) denote the "Higher" and "Lower" correlation 
probabilities, respectively'^^'. 

Global Moran's I statistic and the local Gt(d) 
statistic were employed and mapped using ArcMap 9.3. 
Modeling of the logistic regression was performed using 
SPSS 12.0. 



Results 

The results of the prevalence rates for the nine 
malignant neoplasms between 2005 and 2009 in Taiwan 
were summarized in Table 1. The results showed that all 
prevalence rates related to the leading malignant 
neoplasm for males were higher than those for females. 
Gender ratios, defined as the ratio of males to females, 
generally ranged from 1 to 2, though they were 
occasionally even higher than 6, e.g., for oral cavity 
cancer and esophageal cancer. 

The calculated global autocorrelation statistics for 
the nine malignant neoplasms in males and females in 
Taiwan were summarized in Table 2. The results of the 
Global Moran 's tests for most of the cases related to the 
leading malignant neoplasms were statistically significant 
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Table 1 . Association between Ubc9 expression and clinicopathologic features of breast cancer 



Leading malignant neoplasms (ICD code) 


Male' 


Female' 


Ratio (male : female) 


Tracheal, bronchial, and lung cancer (ICD 162) 


145.32 


94.49 


1.54 


Liver and intrahepatic bile duct cancer (ICD 155) 


241 .78 


122.88 


1.97 


Colon and rectal cancer (ICD 153, 154) 


260.49 


212.00 


1.23 


Stomach cancer (ICD 151 ) 


77.96 


49.68 


1.57 


Oral cavity cancer (ICD 140, 141, 143-146, 148, 149) 


169.83 


23.24 


7.31 


Esophageal cancer (ICD 150) 


35.42 


5.35 


6.62 


Pancreatic cancer (ICD 157) 


21.08 


18.02 


1.17 


Non-Hodgkin 's lymphoma (ICD 200, 202, 203) 


56.31 


45.31 


1.24 


Leukemia (ICD 204-208) 


36.85 


31.09 


1.19 



Data were collected between 2005 and 2009. ' indicates age-adjusted prevalence rates per 100 000 people. 



Table 2. Global autocorrelation analysis of data for the nine malignant neoplasms in Taiwan according to gender 



Leading malignant neoplasms (ICD code) 


Male 




Female 




Moran's index 


Z(l) 


Moran's index 


Z(l) 


Tracheal, bronchial, and lung cancer (ICD 162) 


0.51 


15.95- 


0.21 


6.55- 


Liver and intrahepatic bile duct cancer (ICD 155) 


0.56 


17.60- 


0.40 


12.77- 


Colon and rectal cancer (ICD 153, 154) 


0.55 


17.31- 


0.47 


15.39- 


Stomach cancer (ICD 151 ) 


0.43 


13.86- 


0.35 


11.16- 


Oral cavity cancer (ICD 140, 141, 143-146, 148, 149) 


0.68 


21.09- 


0.58 


18.41" 


Esophageal cancer (ICD 150) 


0.27 


8.45- 


0.23 


7.51 • 


Pancreatic cancer (ICD 157) 


0.22 


6.92- 


0.23 


7.31 • 


Non-Hodgkin 's lymphoma (ICD 200, 202, 203) 


0.07 


2.29- 


0.02 


0.75 


Leukemia (ICD 204-208) 


0.05 


1.74 


0.10 


3.55- 



Data were collected between 2005 and 2009. A value in Z(l) greater than 1.96 is statistically significant and Is Indicated by an asterisk (*). 



(z-scores greater than 1.96) and indicated spatial 
heterogeneity. However, opposing results (z-scores lower 
than 1.96) emerged in two cases of non-Hodgkin 's 
lymphoma for females and leukemia for males, 
respectively. 

The spatial clusters (hot spots) obtained from the 
local Gt(d) statistic for the nine malignant neoplasms for 
both males and females in Taiwan between 2005-2009 
are shown in Figure 2. The z-score outcomes calculated 
by the Gt(d) statistic were categorized as clusters or 
non-clusters at a 5% significance level. The nine 
malignant neoplasms in males and females were 
compared and the results were summarized in Table 3. 
No statistically significant dissimilarity (P > 0.05) was 
observed between the spatial distribution patterns for 
males and females for four of the nine spatial clusters. 
Dissimilarities were found in malignant neoplasms for 
tracheal, bronchial and lung cancer, colon and rectal 
cancer, oral cavity cancer, esophageal cancer, and 
non-Hodgkin 's lymphoma. 



Discussion 

Close locations are likely to share similar attributes. 
In other words, all variables are related, and nearby 
variables are more closely related than distant variables^^'. 
In epidemiology, a cluster becomes apparent when a 
number of health events that are situated in close 
proximity in space and/or time occur. The evaluation of 
spatial distributions as a measure of disease risk may 
provide etiological insights '^^i. Spatial autocorrelation is 
defined as the relationship between the values of a 
single variable. This relationship is attributable to the 
geographical arrangement of areal units on a map and 
can be used to identify the degree of spatial clustering'^-^'. 
In the present study, we used the local G-statistic to 
measure the degree of spatial clustering and to map the 
geographical patterns of areal units. Spatial clustering of 
the leading cause of death (also called a hot spot) was 
defined as a z-score value larger than 1.96. In 
epidemiology, hot spots are appealing because of their 
correlation to etiology. For this reason, the hot spots of 
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Figure 2. Spatial ciusters {hot spots) of the nine 
malignant neopiasms in Taiwan. Maps show the spatial 
clusters of the nine malignant neoplasms in Taiwan: 
tracheal, bronchial, and lung cancer are designated by A; 
liver and intrahepatic bile duct cancer by B; colon and 
rectal cancer by C; stomach cancer is designated by D; 
oral cavity cancer by E; and esophageal cancer by F; 
pancreatic cancer is designated by G; non-Hodgkin's 
lymphoma by H; and leukemia by I. Gender is indicated by 
a number, where male is 1 and female is 2. 



Table 3. Logistic regression modei comparisons of the nine maiignant neopiasms in Taiwan between 2005 and 2009 



Leading malignant neoplasms (ICD code) 


P 


Description 


Tracheal, bronchial, and lung cancer (ICD 162) 


<0.001 


Dissimilarity' 


Liver and intrahepatic bile duct cancer (ICD 155) 


0.253 


Similarity" 


Colon and rectal cancer (ICD 153, 154) 


0.027 


Dissimilarity' 


Stomach cancer (ICD 151 ) 


0.197 


Similarity" 


Oral cavity cancer (ICD 140, 141, 143-146, 148, 149) 


<0.001 


Dissimilarity' 


Esophageal cancer (ICD 150) 


<0.001 


Dissimilarity' 


Pancreatic cancer (ICD 157) 


0.217 


Similarity' 


Non-Hodgkin's lymphoma (ICD 200, 202, 203) 


0.009 


Dissimilarity 


Leukemia (ICD 204-208) 


0.760 


Similarity 



' indicates that both compared sides must be Moran's tested clusters. 



nine malignant neopiasms, as obtained from ttie 
analysis, were indicated, and ttieir spatial locations were 
identified. Information regarding spatial location is useful 
for detecting risl< factors from a spatial viewpoint. A more 
detailed survey of these identified tiot spots may reveal 
significant indications to ttie risk factors of these 



diseases. 

The modifiable areal unit problem (MAUP) is a 
phenomenon in which different results are obtained from 
an analysis of the same data grouped into different sets 
of areal units. Studies of the MAUP based on empirical 
data provided only limited insight into the inability to 
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control relationships between multiple spatial variables. 
Data simulation is necessary to control various properties 
of individual level data. Simulation studies, such as those 
conducted by Swift ef a/. have demonstrated that the 
spatial support of variables can affect the magnitude of 
the ecological bias caused by spatial data aggregation. 
Manley et a/.™ concluded that MAUP is not an actual 
problem, but rather, a resource. Data at different scales 
can help identify processes operating at different scales. 
Defining an ideal single census geography that captures 
all the processes for all variables is evidently impossible'^. 
Furthermore, the internal composition of the areal units 
may not be homogeneous, particularly for disease 
distribution. Matisziw ef al.^^^ suggested that downscaling 
the spatial structure of polygonal units should provide 
valuable information pertaining to the spatial distribution 
of diseases. 

The z-scores for the local G-statistic were calculated 
using the logistic regression model. The results for 
various leading malignant neoplasms and for males and 
females were compared. However, the constraint 
condition for a spatial clustering comparison, e.g., 
defined as Moran's tested clusters on both sides, should 
be satisfied before testing the logistic regression for a 
comparison. Based on this constraint condition, the 
results of the comparison showed statistically significant 
differences for tracheal, bronchial and lung cancer, colon 
and rectal cancer, oral cavity cancer, and esophageal 
cancer. However, a further three cases comparing 
cancers of the liver, intrahepatic bile ducts, and stomach 
as well as the pancreas were not significantly different. 
In other words, the null hypothesis is accepted. The 
accepted null hypothesis results indicate that the 
common spatial factor(s) may interact with both males 
and females. 

Previous ecological studies related to malignant 
neoplasms and their correlation to risk factors in Taiwan 
are scant, although oral cancer and stomach cancer 
have been documented and are discussed below in brief. 
This study of the spatial clustering of Taiwan's leading 
malignant neoplasms can act as a reference in the study 
of spatial epidemiology. 

Areas showing an elevated mortality rate from oral 
cavity cancer for females were clustered in the aboriginal 
townships in eastern Taiwan. Aboriginal women in 
eastern Taiwan had a much higher prevalence of 
cigarette smoking, alcohol drinking, and betel nut 
chewing compared to women in other areas In 
addition, Chiang ef a/.™ suggested that high-risk areas 
of oral cancer incidence in males closely coincide with 
the spatial distribution of serious metal (e.g., chromium 



and nickel) pollution in soils in central Taiwan. Oral 
cavity cancer clusters according to gender were 
calculated in this study using the local G-statistic. The 
results show clear spatial clustering in central (males) 
and eastern Taiwanese aboriginal townships (females). 
The observations thus support the results of previous 
studies. Furthermore, males with oral cavity cancer show 
another spatial clustering in southern Taiwan. Spatial 
risks might have similar causes to previous studies in 
females, e.g., cigarette smoking, alcohol drinking, and 
betel nut chewing, but further studies are required. 

Several meta-analyses have shown a strong and 
consistent association between Helicobacter pylori 
infection and non-cardiac gastric cancer**'^. The ecological 
study in Taiwan suggests an association between 
Helicobacter pylori infection and gastric cancer. 
Helicobacter pylori infection in early childhood may be a 
key issue, and a long induction time is seemingly 
required for gastric carcinogenesis. High gastric cancer 
mortality areas were clustered in the aboriginal townships 
where Helicobacter pylori infection was prevalent 
The results of this study are similar to those of previous 
studies. Stomach cancer clusters for males and females 
showed clustering in Taiwanese aboriginal townships and 
also revealed new carcinogen clustering in the northern 
coastal region of Taiwan. This significant finding warrants 
further investigation. 

Conclusions 

A spatial autocorrelation calculation is useful for 
cluster mapping regional malignant neoplasms. Cluster 
mapping helps clarify issues such as the spatial aspects 
of both internal and external correlations of leading 
malignant neoplasms and spatial pattern differences. 
This method enables planners to assess spatial risk 
factors and to ascertain the most advantageous types of 
healthcare policies for planning and implementing 
healthcare services. These issues can greatly affect the 
performance and effectiveness of healthcare services. 
Therefore, cluster mapping is a clear approach to 
enhance the understanding of the results. 
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